Biologically plausible saliency mechanisms improve feedforward object recognition
نویسندگان
چکیده
The biological plausibility of statistical inference and learning, tuned to the statistics of natural images, is investigated. It is shown that a rich family of statistical decision rules, confidence measures, and risk estimates, can be implemented with the computations attributed to the standard neurophysiological model of V1. In particular, different statistical quantities can be computed through simple re-arrangement of lateral divisive connections, non-linearities, and pooling. It is then shown that a number of proposals for the measurement of visual saliency can be implemented in a biologically plausible manner, through such re-arrangements. This enables the implementation of biologically plausible feedforward object recognition networks that include explicit saliency models. The potential of combined attention and recognition is illustrated by replacing the first layer of the HMAX architecture with a saliency network. Various saliency measures are compared, to investigate whether (1) saliency can substantially benefit visual recognition and (2) the benefits depend on the specific saliency mechanisms implemented. Experimental evaluation shows that saliency does indeed enhance recognition, but the gains are not independent of the saliency mechanisms. Best results are obtained with top-down mechanisms that equate saliency to classification confidence.
منابع مشابه
Object recognition with hierarchical discriminant saliency networks
The benefits of integrating attention and object recognition are investigated. While attention is frequently modeled as a pre-processor for recognition, we investigate the hypothesis that attention is an intrinsic component of recognition and vice-versa. This hypothesis is tested with a recognition model, the hierarchical discriminant saliency network (HDSN), whose layers are top-down saliency ...
متن کاملA model of proto-object based saliency
Organisms use the process of selective attention to optimally allocate their computational resources to the instantaneously most relevant subsets of a visual scene, ensuring that they can parse the scene in real time. Many models of bottom-up attentional selection assume that elementary image features, like intensity, color and orientation, attract attention. Gestalt psychologists, however, arg...
متن کاملShetty, Sanketh v. a Biologically Plausible Architecture for Shape Recogni- Tion. (under the Direction of Professor a Biologically Plausible Architecture for Shape Recognition
SHETTY, SANKETH V. A Biologically Plausible Architecture for Shape Recognition. (Under the direction of Professor Wesley E. Snyder). This thesis develops an algorithm for shape representation and matching. The algorithm is an object centered, boundary-based method for shape recognition. Global features of the shape are utilized to define a frame of reference relative to which local shape featur...
متن کاملNIMBLER: A Model of Visual Attention and Object Recognition With a Biologically Plausible Retina
NIMBLE is a cognitively plausible object recognition system that uses a saccadic visual memory to store and retrieve image fragments. These fragments are acquired by scanning an image in a human-like way using a bottom up saliency model to find informative regions, applying a kernel density function to the fragment to determine its familiarity, and then combining the fragments using naïve Bayes...
متن کاملVisual saliency computations: mechanisms, constraints, and the effect of feedback.
The primate visual system continuously selects spatial proscribed regions, features or objects for further processing. These selection mechanisms--collectively termed selective visual attention--are guided by intrinsic, bottom-up and by task-dependent, top-down signals. While much psychophysical research has shown that overt and covert attention is partially allocated based on saliency-driven e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Vision Research
دوره 50 شماره
صفحات -
تاریخ انتشار 2010